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COLORECTAL CANCER ANTIGEN 

FIELD OF THE INVENTION 

The present invention relates to a novel, diagnostic antigen for colorectal cancer, uses 
thereof, and especially the use thereof in immunotherapeutic treatments for colorectal cancer. 

BACKGROUND OF THE INVENTION 

Colon cancer is a leading cause of mortality in Western countries. Despite the 
improvement of surgery and chemotherapy treatments, the five-year survival rate has not 
significantly altered over several decades (1, 2). Immunological therapies have been intensively 
investigated in patients with melanoma, where treatment with JL-2, as well as the adoptive 
transfer of in vitro cultured tumour infiltrating lymphocytes (TEL), has been found to result in 
cancer regression in a significant percentage of patients (3, 4). 

In contrast, immunotherapy has not provided a benefit to colorectal cancer patients, which 
may be due to the poor immunological characterization of this cancer, limiting the treatment 
options for patients with this disease (5, 6). The presence of a CD8 + T cell infiltrate in colon 
cancer has prognostic value (7); nevertheless, the presence of an inflammatory infiltrate was not 
linked to systemic immunity against cancer in this report. The loss of HLA class I expression 
both in vitro and in vivo has frequently been described in colorectal cancers, and appears to be 
associated with tumour progression (8-10). 

The limited availability of in vitro established tumour lines and specific T lymphocytes 
has in addition hindered analysis of the role of the immune system in colorectal cancer. Although 
a large number of tumour associated antigens (TAA) have been identified, the majority of these 
are either limited in their expression to melanoma or are expressed in melanoma as well as in a 
number of other histologies, including breast, ovarian, lung and prostate tumours (1 1). 

Candidate antigens that appear to be over-expressed in colon cancer, such as 
carcinoembryonic antigen (CEA), the epithelial cell adhesion molecule EP-CAM, HER-2/neu, 
and cyclophilin B, have been evaluated as potential targets for colorectal cancer therapy by 
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carrying out in vitro sensitisations of PBMC with candidate peptides from these molecules that 
bind to particular HLA alleles. 

However, only a relatively small number of potential epitopes have been identified, using 
this approach, and the T cells that have been generated, using many of these peptides, did not 
efficiently recognise native, unmanipulated tumour cells (12-15). 

We have now identified a new tumour associated antigen for colorectal cancer that is 
capable of eliciting a T cell-mediated immune response. 



SUMMARY OF THE INVENTION 

Thus, in a first aspect, the present invention provides a method for stimulating immunity 
against colorectal cancer, comprising stimulating the production of antibodies against the human 
homologue of the Socius gene product, wherein the alanine residue at position 399 is substituted 
by a valine residue. The coding sequence, and the transcript thereof, for the colorectal antigen 
COA-1 are preferably as shown in SEQ ID NO 1, which shows the relationship of genetic 
sequence with the colorectal antigen COA-1 transcript (also shown in Figure 5), and which has 
alanine at position 399. It is this antigen against which an immune reaction can be raised in 
accordance with the present invention. 

More specifically, there is provided the use of a peptide comprising all or an 
immunogenic part of the amino acid sequence designated SEQ ID NO 6 in the manufacture of a 
vaccine to stimulate an immune response against COA-1. The immunogenic part of the 
sequence is referred to herein as the epitopic portion of the sequence, and is sufficient to 
establish a response against COA-1, either as the isolated portion of the sequence, or in the 
context of any surrounding amino acid sequence(s) forming part of a longer sequence. 

In particular, the immunogenic part of the sequence is sufficient, when aa^ninistered in 
the form of a vaccine, to stimulate an immune response, particularly through the maturation of T 
cells. 

The human homologue of the rat Socius gene product, as expressed in non-cancerous 
cells, also comprises alanine at the position corresponding to 399, although it has an extra 75 
amino acid residues compared to the newly discovered COA-1 protein. The COA-1 protein has 
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been shown to have either a valine or an alanine at position 399, the latter appearing to be 
associated with expression by cancerous cells, especially colorectal cancerous cells. Without 
being bound by theory, it appears that the presence of alanine at position 399 of the COA-1 
protein is diagnostic, or at least indicative, of cancer in the tissue expressing it, at least where the 
tissue is colorectal. 

The nucleotide sequence of the human homologue of the Socius gene and its gene 
product are shown in SEQ ID NOS. 19 and 20 respectively. 

However, what is particularly surprising is that it has been established that an epitope 
located between amino acids 372 and 385, inclusive, of the COA-1 transcript is responsible for 
stimulating iirrmunity against the tumour variant of the protein, and that it is not necessary for the 
immunising peptide to comprise the mutation at position 399. 

The immunising peptide comprises an epitopic portion of the peptide 
TLYQDDTLTLQAAG (SEQ ID NO. 6). This sequence may be supplemented with additional 
sequences at either end, up to and including the entire remaining sequences of COA-1, and even 
additional sequences beyond that, if desired, such as might be encountered with a fusion protein, 
for example. As demonstrated herein, more specific supplemental sequences, mcluding FSTFPP 
(SEQ ID NO. 9) at the N-terminus and/or LVPKAA (SEQ ID NO. 10) at the C-terminus both 
permit stimulation. It will be appreciated that, in general, an epitope need not be as long as 14 
amino acids, and that a deletion of a few amino acid residues from either end of the epitope may 
still serve to produce immunity. 

Thus, the present invention contemplates a peptide sequence comprising an epitopic 
portion of SEQ ID NO. 6. The epitopic portion preferably consists of 8 or more, and preferably 
10 or more, contiguous amino acid residues from SEQ ID NO. 6. Where they are part of a 
longer peptide or other molecule, then the epitopic portion is preferably either suitably exposed 
to be able to stimulate an immune response, or is presented in such a manner as to be processable 
to achieve such stimulation when presented to the host's immune system. In this respect, it is 
generally not desirable to use full length COA-1 protein, mutated into the cancerous form, or 
otherwise, as the epitope can be cryptic, in this form. 

It has also been established that the epitope is preferentially expressed by antigen 
presenting cells in association with the alleles HLA DR{}1*0402 or HLA DRpl*1301. It will be 
appreciated that these sympathetic alleles are not necessarily the only HLA alleles able to 
stimulate immunity to COA-1, and that the present invention extends to other sympathetic 
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alleles. Preferably, epitope is preferentially expressed by antigen presenting cells in association 
with either or both of the HLA DRpl*0402 or HLA DRpi*1301 alleles. 

Sympathetic HLA-E alleles are not necessarily present in all members of the human 
population but, where an individual has PBMC's (peripheral blood mononuclear cells) either 
autologous or allogeneic for either of these alleles, then it is sufficient simply to provide a 
vaccine comprising the immunising part of COA-1. 

The immunising portion of COA-1 may be as much as the entire molecule, either with or 
without the mutation at position 399 but, more preferably, it simply comprises a peptide 
comprising at least the immunising epitope located between position 372 and 385 of the COA-1 
transcript. The invention further extends to the sequence between 371 and 384, inclusive, of 
COA-1 as an epitope, as well as to the sequence 371 to 385, inclusive, and 372 to 384, inclusive. 

The immunising epitope may be presented in any suitable form. At its simplest, a 
vaccine comprising the peptide and a suitable carrier may be provided, together with, if required, 
any suitable excipients and/or adjuvants, for example. 

The mtmunogenic peptide may also be presented in the form of nucleic acid in a form 
suitable for expression in the patient, either in a host organism, such as an attenuated virus, in a 
vaccine, or in the form of a suitable expression vector for expression in vivo. 

It will be appreciated that the present invention extends to the sequence for COA-1, as 
well as the transcription product thereof. The invention further extends to the COA-1 sequence 
lacking one or more introns. The sequence of the invention may also lack one or more exons, 
provided that the immunising epitope provided between amino acids 372 and 385 of the wild 
type transcript is encoded. It is not necessary for the amino acid substitution at position 399 to 
be encoded, and it is generally preferred that this substitution is not encoded by the nucleotide 
sequences of the present invention. Without being bound by theory, it is possible that this 
substitution in the sequence of normal cells could affect the processing of the antigen, leading to 
a lack of expression of the immunogenic epitope. It will be appreciated that the degeneracy of 
the genetic code allows the nucleotide sequence to vary widely and still encode the immunogenic 
sequence, but it is generally preferred to use the wild-type sequence, for simplicity, unless it is 
desired to engineer a splice site, for example. 
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Where the patient does not express a sympathetic HLA-II allele, then immunity may be 
conferred in a number of ways, any of which may also be employed in patients expressing a 
sympathetic allele. 

Sympathetic alleles are expressed by PBMC's, such as B cells and fibroblasts. Thus, in 
one aspect, it is sufficient to isolate PBMC's or their progenitors from the patient and to 
transform these cells withHLA DRpl*1301 or HLA DR0 1*0402 alleles, for example. Once 
successful transformation has been achieved, then the PBMC's, whether directly transformed, or 
whether obtained from the progenitors, may be used to stimulate the appropriate immunity, after 
reintroduction into the patient. This may be achieved either by introducing the PBMC's into the 
patient, followed by administration of a vaccine as described above, or the PBMC's may be 
contacted with COA-1, or a precursor therefor, or the immunising epitope or precursor therefor 
and, preferably once there has been some opportunity for endocytosis to occur, the treated 
PBMC's are administered to the patient. It will be appreciated that, in these circumstances, a 
"precursor" may include, for example, a fusion protein or a nucleic acid suitable for expression 
in the PBMC culture. 

It will also be appreciated that suitable PBMC's may be obtained from, for example, a 
universal donor, and an immunising preparation may be made from such cells in a manner 
similar to that described above for transformed cells from the patients themselves. 

It will be appreciated that the present invention extends to vaccines and immunising 
preparations as described above, as well as to host cells expressing COA-1, or a precursor 
therefor, provided that the immunising epitope is comprised in the transcript expressed thereby. 

It will also be appreciated that the present invention extends to the use of antibodies 
recognising COA-1 having alanine at position 399. Such antibodies may be used as a passive 
vaccine, for example or may be used in diagnostic assays for colorectal cancer. Such assays may 
take the form of ELISA assays, for example, or may be used in suitable immunoblotting 
techniques. 

The invention extends to the COA-1 protein, and especially to fragments thereof 
comprising an epitopic sequence, as defined above. Such fragments may further comprise 
additional amino acid residues up to and including alanine at position 399 of SEQ ED NO. 1, and 
includes such fragments where residues between the epitope and position 399 are conservatively 
substituted, or there are one or more deletions, insertions and/or inversions that do not block the 
antigenicity of the epitope. 
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The invention further provides a vaccine comprising a peptide of the invention and 
PBMC's expressing a sympathetic allele therefor, preferably an MHC Class II allele. 

Thus, COA-1 is thought to be an immunodominant antigen mediating an anti-tumour 
immune response in Colorectal Cancer (CRC) patients. COA-1 is, therefore, thought to be 
useful as an immunogenic antigen for mediating an anti-tumour immune responses in CRC 
patients, the response preferably correlating with the progression of the disease. Thus, it is also 
thought to be useful in the provision of immunotherapy protocols, such as peptide vaccination or 
adoptive transfer of antigen specific T cells for CRC, as well as being a useful marker for the 
prognosis of the disease. 

Preferably, the peptide is an oligopeptide, preferably having 50% or less of the amino 
acid sequence of COA-1, preferably 40% or less, preferably 30% or less, preferably 20% or less, 
and most preferably 10% or less. 

Preferably, the peptide comprises the amino acid sequence designated SEQ ID NO 6, and 
raises an immunogenic response by administration thereof. Preferably, eliciting a CD4 + Tcell 
response in an individual. 

We have also found that the peptide raises an immunogenic response in melanoma cells. 
Therefore, it is also preferred that the immune response is stimulated against melanoma cells. 

DESCRIPTION OF THE DRAWINGS 

In the following Example reference is made to the accompanying Figures, in which: 

Figure 1 shows a phenotypic characterization of the colorectal cancer line 1869 col. 

Figure 1A shows a stained 1869 col cell line using antibodies directed against MHC class 
I (W 6/32) and class II (L243) molecules, an epithelium marker (Ber-EP4), and the P subunit of 
prolyl-4-hydroxylase (5B5), a protein expressed exclusively in fibroblasts. 

Figure IB shows intracellular staining carried out using three cytokeratin reactive 
monoclonal antibodies: CK18, which reacts with cytokeratin 18; LP34, which reacts with 
multiple cytokeratins; and MNF1 16, which reacts with cytokeratins 5, 6, 8, 17 and probably 19. 

Figure 1C shows staining of 1869 col cells at passage 6 (P6) and passage 20 (P20), 
carried out with the anti-CEA monoclonal antibody Col-1. 
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Figure 2 shows a cDNA clone isolated from the 1869 cDNA library encoding an 
antigen recognised by Oil T cells. 

The 293 cells expressing the MHC DRp 1*0402 or 1301 molecules were transfected with 
the 1D8 cDNA clone, or COA-la, which corresponds to nucleotides 209-1318 of the COA-1 
gene (see Figure 3). 

Target cells were either transfected with the COA-la product alone or were co- 
transfected with a mixture of COA-la and the full length HLA class II invariant chain (Ii). 
Additional targets were transfected with a control plasmid encoding GFP. Eighteen hours 
following the addition of 5xl0 4 CI 1 1 T cells to the transfectants, supernatants were collected and 
IFN-y release was measured by ELISA. 

Figure 3 provides the sequence of the COA-1 gene (SEQ ID NO. 1) isolated from the 
mRNA of the tumour line 1869 col. 

The COA-1 gene was isolated by RT-PCR from the 1869 col tumour cell line. The 
amino acid sequence of the 1D8 cDNA clone (SEQ ID NO. 12) is shown in bold letters. The 
amino acid sequence corresponding to the T cell epitope (SEQ ID NO.6) is underlined, and the 
single nucleotide difference between the normal and tumour transcripts at position 1280 is noted. 

Figure 4 shows that the COA-1 transcript derived from normal B cells is not 
recognised by the clone Oil T cells. 

293 cells expressing the indicated MHC DR.pi molecules were transfected with COA-la 
cDNAs isolated by RT-PCR from either the 1869 col cell line or from 1869 CD40L stimulated B 
cells. The GFP and Ii-1D8 constructs were used as negative and positive controls, respectively. 
Eighteen hours following the addition of 5xl0 4 Oil T cells to the transfectants, supernatants 
were collected and IFN-y release was measured by ELISA. Dark shading represents 293- 
DR*1301. Hatched shading represents 23-DR*0402. 

Figure 5 shows the relationship of the genetic sequence of COA-1 to the transcript. 

The nucleotide sequence of COA-1 is shown in relationship to the protein sequence. The 
gCc triplet comprising C at nucleotide position 1280, encodes Alanine The amino acid sequence 
(SEQ ID NO. 2) of the longest open reading frame in this transcript, which is similar to the 
Socius gene product (20), is noted beneath the nucleotide sequence. 



WO 2005/039632 



8 



PCT/EP2004/012087 



DETAILED DESCRIPTION OF THE INVENTION 

Several tumour reactive CD4 + T lymphocytes were isolated from PBMC and TIL that 
were obtained following the establishment of autologous cultured colon tumour cell lines. These 
studies focused on a single clone of CD4 + T cells, CI 1 1, that responded strongly to autologous 
tumour cells, and demonstrated low but significant reactivity with autologous EBV B cells, but 
failed to respond to autologous CD40L stimulated B cells. The gene encoding this antigen, 
termed COA-1, was isolated by screening an autologous cDNA library with clone CI 1 1 T cells. 
This gene appeared to be nearly identical to the gene encoding the human homologue of the rat 
Socius protein that was recently cloned using a yeast two-hybrid screening assay in which a 
member of the Rnd family of GTPases was used as bait (20). The Socius product was expressed 
at high levels in rat testis, but was expressed at significantly lower levels in rat lung, thymus and 
brain. 

The longest open reading frame in the COA-1 transcript encodes a 437 amino acid 
product that corresponds to a portion of the human Socius gene product, and two overlapping 
peptides derived from this open reading frame were identified that could sensitise target cells 
expressing either HLA-DR|3 1*0402 or 1301. The stimulation observed with peptide pulsed 
targets was weak relative to that seen with the tumour cell lines that were recognised, and a 
minimum concentration of approximately 10 uM was needed to stimulate significant cytokine 
release from CI 1 1 T cells (Table 4). 

Peptides derived from non-mutated tumour antigens such as tyrosinase (23) and TRP-1 or 
TRP-2 (17) have also been found to stimulate only relatively low levels of cytokine release from 
HLA class H-restricted, tumour reactive T cells, and rmnimal concentrations of between 1 and 10 
uM of the peptides identified in these studies were required to sensitise target cells for T cell 
recognition. This may reflect the fact that these represent non-mutated self antigens, and that 
self tolerance results in the deletion of T cells that recognise peptides that bind to class II 
molecules with high affinity. 

In addition, the autologous tumour cell line should present this peptide in the context of 
both the HLA-DRpl*0402 and 1301 restriction elements, leading to enhanced stimulation of T 
cells reactive with this epitope. Transfectants expressing the COA-1 product stimulated 
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significantly less cytokine release from CI 1 1 T cells than the autologous tumour cell line that 
had been induced to express high levels of HLA class H molecules. One potential explanation 
for this observation, however, is that the HLA class II positive 293 cells used as targets for 
transfection of the COA-1 gene products fail to express optimal levels of accessory molecules 
associated with the processing of this epitope. 

The COA-1 transcript is nearly identical to sequences derived from a variety of tissues 
and tumour cell lines. These transcripts, however, comprise a large array of over 20 alternatively 
spliced products that are derived from at least 15 exons residing at the chromosome Ip36.1-p35 
locus. The COA-1 product expressed in colon tumour cell lines appeared to contain a unique 
splicing pattern that did not correspond to any of the transcripts identified in the EST and 
GenBank databases, which may not encode products recognised by CI 1 1 T cells. Two nearly 
identical COA-1 gene products were amplified from EBV B cells, one of which was identical to 
that isolated from the colon tumour cells, and a second that contained a single nucleotide 
alteration at position 1280 that resulted in a substitution of a valine residue for the alanine 
residue at position 399 encoded by the dominant colon tumour cell product. It is not clear why 
CI 1 1 T cells only appeared to weakly recognise EBV B cells expressing the appropriate HLA 
class II gene products, but these observations could result from inherent differences in the 
antigen processing abilities of colon tumour cells and EBV transformed B cells. 

Previous results have suggested that differences in the proteosomal subunits expressed by 
various cells may significantly influence antigen recognition, which provides one potential 
explanation for this finding (24). The RT-PCR products that were amplified from normal B cells 
and fibroblasts also appeared to uniquely encode the COA-1 variant that expressed a valine 
residue at amino acid 399, and target cells that were transfected with the COA-1 product that was 
amplified from normal cell lines were not recognised by CI 1 1 T cells. 

Thus, it appears that normal B cells and fibroblasts either fail to express the COA-1 
transcript that can be processed and presented to CI 1 1 T cells or express this product at only 
relatively low levels. The mechanisms involved in the preferential expression of these two 
transcripts are unknown, but these may represent the products of two nearly identical genes 
whose expression is differentially regulated. The correlation between expression of these 
products and the ability of CI 1 1 T cells to recognise the epitope encoded by these products 
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provides further evidence that this represents the natural product recognised by these T cells and 
not a peptide mimic of the natural epitope. 

An additional observation, that is further discussed below, is how the alteration at 
position 399 affects recognition of the cell epitope comprised of amino acids 372 to 385 of the 
COA-1 transcript. Results of a previous study indicated that alteration of a distal residue can 
influence the ability of tumour reactive CD4+ T cells to recognise a mutated product of the 
CDC-27 gene product (21). Preliminary results presented in the prior study indicated that altered 
intracellular targeting of the mutated CDC-27 gene product may have played an important role in 
influencing processing of this gene product. Investigation of the cellular localization of the 
COA-1 protein in normal and tumour cells may help to indicate whether a similar mechanism 
may be involved with T cell recognition of this product. 

Transfection studies, as well as peptide pulsing experiments, indicated that either of the 
autologous HLA-DR01 alleles, DR01*O4O2 or DRpi*1301 could present the T cell epitope to 
clone CI 1 1 T cells, which may potentially enhance the immunogenicity of this peptide in patient 
1869 as well as other individuals that express these class II alleles. This observation is not 
unique, however, as examples of promiscuous recognition of class II and well as class I restricted 
epitopes have been noted in previous studies. In one report, CD4+ T cells were identified that 
also recognised an epitope of the herpes simplex type 2 virus virion protein, VP 16 in the context 
of DRpl*0402, 1102 or 1301 but not several closely related DR4, 11 or 13 subtypes (25). The 
sequences of the DRpi*0402, 1 102 and 1301 molecules are identical in a polymorphic region 
between amino acids 67 and 71, and site directed mutagenesis studies demonstrated that these 
residues were critical for the recognition of the viral epitope. 

High levels of lymphocyte infiltration into tumours have been shown in some studies to 
be correlated with a good prognosis (26), but detailed investigations of the reactivity of 
infiltrating T cells have not been carried out. The expression of HLA class II molecules on 
colorectal cancer cells is also a favourable prognostic marker (27) (28). Previous studies resulted 
in the isolation of HLA class I (29) and class II (30, 31) restricted tumour reactive T cells from 
colon cancer patients, but only a hmited panel of shared tumour specific antigens were identified 
in these studies. 
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Peripheral blood lymphocytes isolated from CRC (colorectal cancer) patients were in 
vitro stimulated with the COA-1 derived epitope and tumour reactivity has been verified. 
Tumour-specific CD4 + T cells were isolated from 3 patients with progressive disease; although a 
single failure in generating COA-1 specific T cells was observed in CRC patient (n.4) with early 
stage tumour. 

In collaboration with the clinical centre of the Fatebenefratelli Hospital, Rome, peripheral 
blood samples from CRC patients have been collected to confirm whether an immune response 
directed to COA-1 is commonly detectable in a large number of patients expressing specific 
MHC class II molecules and with metastatic disease. These results seek to demonstrate that 
COA-1 is a relevant antigen for the anti-tumour immune response in CRC patients correlating 
with the progression of the disease. 

In addition, we have also shown that COA-1 -specific reactivity could be isolated from 
PBMCs of CRC patients using professional antigen presenting cells, dendritic cells (DC) loaded 
with tumour expressed antigen array. DC were generated, in the presence of GM-CSF and OTST- 
alpfa, from monocytes of one CRC patient (anti-COA-1 T cells were previously isolated from 
the same patient, in the Example), loaded with autologous CRC line-derived lysate and used for 
in vitro stimulation of PBMCs. 

After three stimulations both anti-COA-1 and tumour reactive T cells have been isolated. 
Tumour reactive and COA-1 specific CD4 + T cells could be isolated from the same CRC patient 
by in vitro stimulation of PBMCs either with intact tumour cells and with DC pulsed with 
tumour lysate. These results indicate that COA-1 can represent an immunodominant antigen 
mediating an anti-tumour immune response in CRC patients. 

COA-1 specific T cells recognised specifically only tumour cells and not normal cells, 
though both types of cells express this antigen (see the Example), suggesting that a differential 
localization and/or processing of this antigen could occur in malignant or normal cells. To 
investigate this issue, a laser scanning confocal microscopy analysis was carried out on a panel 
of normal and tumour cell lines by using a specific polyclonal antibody directed to COA-1. The 
intra-cellular localisation and the translocation pathway to the cell membrane of COA-1 were 
studied. 
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Localisation of the protein in the cellular cytoplasm was observed both in tumour and in 
normal cells, whereas nuclear localization of the protein was found only in CRC and fibroblasts 
cell lines. Association of the protein with Golgi apparatus has been selectively detected in 
tumour cells and, moreover, co-localization of COA-1 with one of the microtubule components, 
tubulin, occurred only in fibroblasts. 

It is notable that COA-1 was only associated with HLA class II molecules in tumour 
cells. Thus, taken together these results indicate that, with regard to the COA-1 antigen, 
differential localisation and distinct pathways of cellular translocation occurred in normal and 
malignant cells. 

Therefore, we conclude that the differential localisation of the protein could affect the 
HLA molecule-associated presentation of COA-1 -derived immunogenic epitopes, resulting in 
the antigen's ability to raise a tumour specific immune response. 

The recombinant COA-1 protein has now been synthesised, and this can be used to 
produce specific antibodies, including monoclonal antibodies. In addition, amultimeric 
immunogenic peptide, a complex of multiple chains of the COA-1 -derived epitope, has been 
synthesized and used to produce antibodies specific for the epitope of COA-1 that can raise an 
immune response. 

These reagents represent useful tools for evaluating the presence of antibodies directed to 
COA-1, or of the protein itself, in the serum of CRC patients. Moreover, this investigation can 
be to correlated the follow-up of patients to evaluate COA-1 as a prognostic marker for the 
disease. In addition, the new synthesized anti-COA-1 antibodies can be used to confirm the 
results of the analysis of COA-1 cellular localization. 

The invention is not to be limited by what has been particularly shown and described, 
except as indicated by the appended claims. Indeed, while the invention will now be illustrated 
in connection with the following Example, it will be understood that it is not intended to limit the 
invention to these particular embodiments. On the contrary, it is intended to cover all 
alternatives modifications and equivalents, as maybe included within the scope of the invention 
as defined by the appended claims. 
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Example 

Material and Methods 

Cell lines and antibodies. 

Colon cancer lines were generated from tumour liver metastases of rive patients admitted 
to the Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, 
USA. The cell lines were generated from the tumour samples by cutting the tissue into small 
fragments, followed by filtration through sterile gauze. The tumour cells were cultured in 
collagen-coated 6-well plates (Becton Dickinson, Franklin Lakes, NJ) in ACL-4 medium 
(InVitrogen, Carlsbard, CA) containing 10 % foetal bovine serum plus MEGM SingleQuots 
(Clonetics, Walkersville, MD) that contained epidermal growth factor (10 ng /ml), insulin (5 
yg/ml), hydrocortisone (0.5 ug /ml), gentamicin (50 pg/ml), and amphotericin-B (50 ng/ml). 
Fresh medium was added to the cells every 5 days and fibroblasts were depleted from the 
cultures by carrying out a short-term treatment with trypsin. Immunofluorescent staining assays 
to assess cell surface HLA gene expression were carried out using the anti-class I mAb W6/32 
and the anti-DR mAb L243 (Becton Dickinson). 

The cell lines were stained using the mAb BerEP4 (DAKO, Cupertino, CA) that is 
directed against a cell surface molecule whose expression appears to be limited to epithelial 
tissues, and intracellular staining was carried out using the cytokeratin reactive mAbs CD 18, 
LP34 and MNF1 16 (DAKO). Analysis of the expression of carcinoembryonic antigen (CEA), a 
molecule that is frequently over-expressed in colon tumour, was carried out using the mAb Col-1 
(Zymed, South San Francisco, CA). The presence of fibroblasts in the cultured colon tumour 
cell lines was assessed using the mAb 5B5 (DAKO) that was directed against the p subunit of 
prolyl-4-hydroxylase, a protein involved with the synthesis of collagen. Flow cytometry was 
carried out using a FACScan (Becton Dickinson). The established colon cancer lines SW1463, 
SW480 and Colo205 were obtain from American Type Culture Collection (ATCC, Manassas, 
Virginia). The melanoma cell line 1681, the fibroblast cell line 1519 and the EBV-transformed 
B cell lines 1869 and 1519 were established in the Surgery Branch and were cultured in RPMI 
plus 10 % FBS. The normal B cell lines 1847, 1681, 1872 and 1869 were generated, as 
previously described (16), by culturing PBL in ISCOVE's medium (InVitrogen) plus 10 % 
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human serum in the presence of 100 IU/ml of CD40L (Immunex, Seattle, WA) and 100 IU/ml of 
recombinant human IL-4 (Pharmingen, San Diego, CA). The MHC class I and class H typing of 
the PBL and of the tumour lines used in this study was determined by single-stranded 
oligonucleotide probe-PCR typing carried out in the NIH HLA typing laboratory, and is 
summarised in Table 1 . Antibodies used to carry out T cell receptor (TCR) analysis were 
obtained from Beckman/Coulter (Miami, FL) or Pierce/Endogen (Rockford, H). 



Identification and characterization of tumour reactive T cells. 

Tumour reactive T lymphocytes were generated from PBMC and tumour infiltrating 
lymphocytes (TIL) derived from colon cancer patients. Incubation of PBMC with autologous 
tumour cells that had been irradiated with 150 Gy was carried out at a tumour cell to lymphocyte 
ratio of 1 to 5 in RPMI media containing 300 nj/ml of recombinant human IL-2 plus 10 % 
human serum (HS). The cultures were stimulated weekly for a period of 5 to 6 weeks with 
autologous irradiated tumour cells. Cultures of TIL were established by initially plating fresh 
uncultured tumours at 5xl0 5 cells per well in 24-well plates in RPMI containing 10 % HS and 
1,000 IU/ml of EL-2. Tumour cells used for T cell stimulation were cultured for at least 10 days 
in RPMI containing 10 % HS to avoid the generation of T cells with reactivity against FBS. In 
addition, to optimise or up-regulate the expression of MHC molecules by tumour cells, these 
cells were incubated with IFN-y (500 IU/ml) for 48 hr. The reactivity of the T cell lines against 
colon cancer lines was examined by incubation of 2X10 4 or, for some of the assays, 5X10 4 T 
cells in flat bottom 96-well plate in the presence of 5x1 0 4 autologous or allogeneic tumour cells. 
After overnight incubation at 37°C in 5 % C0 2 , the supernatants were collected and T cell 
responses were evaluated using anti-IFN-y antibodies (Endogen, Rockford, IL) in a sandwich 
ELISA assay. 

After 3 weeks of culture the T cell lines were cloned by limiting dilution in the presence 
of allogeneic PBMC that had been irradiated with 50 Gy in RPMI media containing 30 ng/ml of 
OKT3 mAb in RPMI plus 10 % HS. The following day, fresh medium plus rh-EL-2 (300 IU/ml) 
was added to the cultures. After two weeks of culture, growth positive wells were screened for 
their ability to release IFN-y in response to tumour stimulation. The T lymphocytes from 
sensitised PBMC that were chosen for further analysis, C4, C49 and CI 1 1, were isolated from 
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cultures that were plated at 5 cells per well, but only 27% of the wells were positive for growth 
under these conditions, showing that some or all of these cells represent T cell clones. 

Analysis carried out with antibodies directed against T cell receptor (TCR) families 
showed that greater than 95% of clone C4 T cells expressed a TCR reactive with an anti-Vp5 
reactive antibody, whereas C49 failed to express TCRs detected by any of the commercial 
antibodies. Amplification of the clone CI 1 1 TCR V(3 region product carried out using RT-PCR 
showed that this clone expressed a single sequence derived from the Vpl8 germline gene. Flow 
cytofluorimetric analysis showed that approximately 80% of CI 1 1 T cells expressed Vbl8, but 
contaminating feeder cells used to expand the T cell clone may be responsible for the 
discrepancy between these results. Two CD4+ tumour reactive T cell cultures, C5 and CI 5, 
were also identified from 1869 TIL. These cultures were isolated from cells that were plated at 
one cell per well, and, as only 3% of the wells that were plated were positive for growth, these 
represent T cell clones. In addition, these cultures stained homogeneously with an antibody 
directed against Vb2, further showing that these represented T cell clones. 

Tumour reactive cultures were then expanded in the presence of allogeneic PBL that were 
irradiated with 50 Gy in RPMI containing PHA (lug/ml) and IL-2 (300 IU/ml). 
Immunofluorescent analysis of positive cultures was carried out using mAb directed against 
CD3, CD4, CD8, CD16, and CD56 (Becton Dickinson). Antibody blocking assays were carried 
out by pre-incubating target cells for 1 hour with W6/32, an antibody directed against a pan- 
MHC class I epitope, or L243, a mAb directed against a pan-HLA class II DR epitope. The T 
cells were then added to target cells, and IFN-y release measured following an overnight 
incubation. 

CHTA transduction of tumour lines. 

In order to induce stable expression of cell surface MHC class II molecules, the tumour 
lines 1869 col, SW480, and Colo205 were transduced with a recombinant retrovirus that was 
generated by cloning the gene that encoded the human class II transactivator (CIITA) into the 
retroviral expression vector pCLRCX (17). The transduced 1869 tumour cells were then sorted 
using a FACSVantage™ cell sorter (Becton Dickinson) to obtain cells that homogeneously 
expressed relatively high levels of cell surface HLA class II expression. 
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Isolation of MHC class II DR01 molecules. 

The DRp 1 *0402 gene was isolated by carrying out an RT-PCR with RNA derived from 
the tumour line 1869 col, and the DRp 1*1301 gene was obtained by carrying out an RT-PCR 
with RNA derived from an autologous T cell line. Primers that were used to amplify HLA-DR 
were: 5 ' -TCC AGC ATGGTGTGTCTGA-3 ' (SEQ ED NO 13) and 5'- 
CCTTGAATGTGGTCATCT-3 ' (SEQ ID NO 14). Two additional primers were designed to 
specifically amplify the HLA-DR13 gene product: 5'CGTTTCTTGGAGTACTCTACGTC-3' 
(SEQ ID NO 15) and 5'-CCACCGCGGCCCGCTCGTCT-3' (SEQ ID NO 16). The isolated 
products were cloned in the plasmid vector pCR-Blunt (Invitrogen, Carlsbard, CA) and 
sequenced using an ABI Prism 310 Genetic analyser (Perkin-Elmer, Shelton, CT). The genes 
were then cloned in the eukaryotic expression vectors pCDNA3.1 (Invitrogen) and the retroviral 
expression vector CLRCX4, discussed above. 

Constructs encoding either of the HLA-DRpl genes were co-transfected along with a 
construct encoding the HLA-DRoc gene into 293 cells. Stable transfectants were stained with the 
FITC labelled anti-HLA-DR mAb L243, and cells that were strongly positive for the expression 
of the cell surface HLA-DR molecules were isolated using a FACSVantage™ cell sorter (Becton 
Dickinson). To induce the expression of molecules involved with HLA class II antigen 
processing, such as the class II invariant chain, DMA, and DMB genes, the 293 cells that had 
been transfected with the HLA-DR constructs were then transduced with recombinant retroviral 
supernatants generated using the CLRC-CITTA construct, as previously described (17). 

cDNA Library Construction and screening. 

Total RNA was extracted from 1869 col tumour line using Triazol (GIBCO, BRL) and 
poly (A) RNA was then isolated using poly (A) Tract (Promega, Madison, WI). The poly (A) 
RNA was then converted to cDNA using the Superscript cDNA Synthesis kit (InVitrogen) and 
cloned in the episomal mammalian expression vector pEAK8 (Edge BioSystems, Gaithersburg, 
MD). The pEAK8 vector had been modified by cloning a fragment encoding amino acids one to 
80 of the human invariant chain (Ii) downstream of the EFl-cc promoter in order to express the 
cDNA inserts as fusion constructs and target the gene products to the HLA class II antigen 
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presentation pathway. The recombinant cDNA was then electroporated into DH10B 
electrocompetent cells (InVitrogen), andplasmid pools containing approximately 50 cDNA 
recombinants prepared as previously described (18). The 293 cell lines that were transfected 
with HLA-DR(J 1 *0402 (293-DR0402) or HLA-DRp 1*1301 (293-DR1 3) were transiently 
transfected with DNA prepared from the cDNA pools (200 ng) using Lipofectamine 2000 
(InVitrogen) according to the manufacturer's directions. 

In order to conserve CI 1 1 T cells, screening assays were initially carried out by 
transfecting a mixture of 5xl0 4 293-DR*0402 and 5xl0 4 293-DR*1301 cells with cDNA library 
pools in 96 well flat bottom plates. The following day the cells were washed and 1x10 s cells T 
cells in ATM-V medium plus 2 % HS were added each well. After 1 8 hrs of incubation at 37°C 
and 5 % C0 2 , 100 of supernatant was collected and the IFN-y release was evaluated by 
ELISA. For subsequent assays, cDNA pools and clones were transfected into 293 cells that 
expressed only a single HLA DR allele, and these cells were tested for their ability to stimulate 
Clll T cells. 

5' Rapid amplification of cDNA ends (RACE). 

Total RNA was extracted from the 1869 col tumour cell line and a 5' RACE was 
performed using the Smart RACE cDNA amplification kit according the manufacturer's 
instructions (Clontech, Franklin Lakes, NJ). The RT-PCR products were cloned into the 
pCDNA 3.1 Topo cloning vector (Invitrogen) and recombinant DNA was prepared for sequence 
analysis. In addition, amplification of the full length COA-1 gene products was carried out using 
the Advantage 2 PCR kit (Clontech). The amplification was carried out by incubation at 95°C 
for 1 minute, followed by 35 amplification cycles consisting of a 30 second incubation at 95°C, a 
30 second annealing step at 62°C, and a 2 minute extension step at 68°C. 



Identification of T cell epitopes. 

Peptides of 20 or 21 amino acids in length that overlapped by 15 amino acids that were 
encoded by the long open reading frame of the original cDNA clone that was isolated were 
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synthesised by solid-phase method using a peptide synthesiser (AMS 422; Gilson Co., Inc. 
Middleton, WI). The purity of the peptides was verified by mass spectrometry (Tuft's Core 

Facility, Boston, MA). Allogeneic B cells (1x10 cells/well) that expressed either the 
DRpi*0402 or the DR(31*1301 molecules were incubated with 50 ug/ml in 100 pj/well of 

ISCOVE'S medium plus 10 % HS in flat bottom-96-well plates. After three hours, l-5xl0 4 T 

cells were added to the wells in 150 ul/well of medium and incubated for 18 hours at 37°C and 
5% C02, followed by measurement of INF-y release by ELISA.. 
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Generation and characterization of colon cancer lines. 

Cultured colon cancer lines were initially established from liver metastasis specimens 
obtained from five colorectal cancer patients. Analysis of one of the most rapidly proliferating 
cell lines that was obtained, 1869 col, demonstrated that these cells expressed a common 
epithelial marker, expressed cytokeratins associated with epithelial cells (Fig.l), and maintained 
a morphology in tissue culture that was typical of epithelial cells (data not shown). 

In contrast, the cell lines did not stain with an antibody directed against the p subunit of 
prolyl-4-hydroxylase, a cell surface marker expressed in fibroblasts. Taken together, these 
results indicated that these cells were of epithelial origin and represented colon cancer cell lines 
and did not contain significant numbers of normal cells. The 1869 col cell line expressed 
uniform levels of MHC class I molecules and low or undetectable levels of cell surface MHC 
class II molecules were found on the same cells (Fig. 1), but treatment of the 1869 col cells with 
IFN-y resulted in strong up-regulation of HLA class II expression (data not shown). 

The carcinoembryonic antigen represents a marker that is expressed at high levels in vivo 
on colon tumour cells as well as on many colon tumour cell lines, but is not expressed by 
fibroblasts or hepatic cells. Analysis of 1869 col cells indicated that they expressed CEA (Fig. 
1), and the additional colon tumour cell lines that were generated appeared to express similar 
levels of this gene product (data not shown). An early passage of the 1 869 col cell line 
demonstrated high level expression of CEA, and lower but still significant levels of CEA 
expression were observed at later passages of 1869 col cells (Fig. 1). These observations are 
consistent with previous studies in which heterogeneous expression of CEA was observed on a 
variety of colon rumour cell lines (19). 

Isolation and characterization of colon cancer reactive T lymphocytes. 

In the initial attempts to derive colon tumour reactive T cells, tumour infiltrating 
lymphocytes (TIL) from patient 1869 were cultured in high dose IL-2. In addition, autologous 
tumour cells, that had been treated with IFN-y to up-regulate HLA class II gene expression, were 



WO 2005/039632 PCT/EP2004/012087 

20 

used to carry out in vitro mixed lymphocyte tumour cultures (MLTC) with PBMC from patient 
1 869. Three CD4 + tumour reactive T cell clones, C4, C49 and CI 1 1, were initially selected for 
further analysis on the basis of their high degree of reactivity with the autologous tumour cell 
line. 

The three clones derived from PBMC released IFN-y in response to autologous tumour 
cells that had been treated with TFN-y, and these clones released significantly higher levels of 
IFN-y in response to 1869 tumour cells that had been treated with the CUTA and sorted for cells 
that constitutively expressed high levels of cell surface HLA class II molecules (Table 2). 

Relatively low levels of IFN-y were released following stimulation with the autologous 
1869 EBV B cell line from the three T cell clones. All of the T cell clones released IFN-y and 
GM-CSF but not IL-4 following stimulation with HLA class II positive tumour cells (data not 
shown), indicating that they represent cells of the Thl cell phenotype. 

In order to test whether the clones isolated from the PBMC recognised tumour cells in an 
MHC-restricted manner, cytokine release assays were carried out in the presence of anti-HLA 
class I and class II specific antibodies using stimulator cells bearing a variety of MHC 
haplotypes (Table 1). The results indicated that the C4, C49 and CI 1 1 T cell clones recognised 
the autologous tumour cells in the context of the HLA DR. class II restriction element (Table 2). 
The C49 and CI 1 1 T cell clones also recognised the CIITA transduced allogeneic MHC class it 
colon cancer lines SW480 and Colo 205 that shared expression of HLA-DRpl*1301 with the 
autologous tumour, and this recognition was blocked by pre-incubation of the tumour cell lines 
with the anti-HLA-DR mAb. 

Generally the responses were inhibited by between 50 and 90% by pre-incubation with 
the anti-HLA DR antibody, whereas less than 20% inhibition was observed with the anti-HLA 
class I antibody. The response of the C4 line to the SW480 CIITA treated tumour cell lines, as 
well as the response of CI 1 1 to theColo205 CUTA, were only partially inhibited by anti-HLA 
DR antibody, which might reflect the fact that these T cells can recognise additional ligands 
other than the classical TCR. The C4, C49 and CI 11 clones recognised autologous EBV B cells 
as well as an allogeneic EBV B cell line that shared expression of HLA DR0 1*1301 with 
autologous cells. Normal B cells that were generated by stimulating autologous PBMC with 
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CD40 ligand plus IL-4, as well as an allogeneic fibroblast cell line that shared expression of 
HLA DRpn*1301 with the 1869 col tumour and that was treated with BFN-y to up-regulate HLA 
class II gene expression, stimulated little or no cytokine release from these T cells (Table 3). 

Two CD4+ T cell clones from TIL 1869 that responded in preliminary assays to 
autologous HLA class II positive tumour cells were also tested for their ability to recognise 
autologous as well as allogeneic colon tumour cell lines. Clones C4, C49 and CI 1 1, as well as 
two clones derived from 1869 TIL, C5 and CI 5, responded to the allogeneic colon tumour cell 
line 1847 col that shared expression of the HLA-DRp 1*1301 gene product with the autologous 
tumour. In contrast, the allogeneic 1872 col cell line that did not share expression of any HLA 
DR gene products with the 1869 col tumour failed to stimulate significant cytokine release from 
the T cell clones. 



Identification of the antigen recognised by Oil T cells. 

Further studies aimed at identifying tumour antigens expressed on 1869 col cells focused 
on CI 1 1 T cells, which was the only T cell clone that expanded sufficiently to allow the cDNA 
library to be screened. The results of studies carried out with additional tumour histologies 
indicated that CI 1 1 T cells did not recognise two allogeneic renal cell lines, as well as a prostate 
tumour cell line that shared expression of HLA-DRp 1*1301 with the 1869 col cell line (data not 
shown). A single allogeneic melanoma cell line that expressed HLA-DRp 1*0402 was identified, 
1681 mel. Cell surface HLA class II expression was up-regulated following treatment of the 
1681 mel cell line with IFN-y, and the treated cells were recognised by CI 1 1 T cells, indicating 
that certain tumour types shared expression of the antigen recognised by these T cells (Table 3). 

Stable transfectants of the 293 cell line that expressed either the autologous MHC class II 
DR|31*0402 or 1301 gene products molecules were then mixed in equal numbers and transiently 
transfected with DNA pools generated from the autologous tumour cell cDNA library. The 
positive pool that was initially identified following the screening of approximately 3xl0 4 clones, 
4G3, appeared to sensitise either 293-DRpi*0402 or 1301 target cells for recognition by CI 1 1 T 
cells, and a single cDNA clone that could sensitise target cells for recognition by CI 1 1 T cells, 
1D8, was identified (Fig. 2). 
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An assay carried out by transfection of the 293-DRpl*0402 + or 1301 + cell lines 
individually with the 1D8 cDNA indicated that either of these HLA class IT restriction elements 
could present the T cell epitope to CI 1 1 T cells. In contrast, 293 cell lines that expressed the 
HLA-DRp 1*0101, 0401, 0701 or 1601 class E alleles failed to stimulate these T cells following 
transfection of the 1D8 cDNA clone (data not shown), indicating that presentation of this epitope 
to CI 1 1 T cells may be limited to the two autologous HLA-DR alleles expressed by 1869 col 
cells. Further screening of the cDNA library resulted in the isolation of a second cDNA clone 
that was nearly identical to the 1D8 clone. The isolation of a second clone with a nearly 
identical sequence supports the finding that this represents the natural transcript encoding the 
antigen recognised by CI 1 1 T cells 

Characterization of colorectal tumour associated antigen COA-1. 

The 1D8 insert contained a 44 bp polyA tail at the 3' end, but appeared to represent a 
partial cDNA clone as it was only 291 bp in length. The 5' end of the gene product that was 
expressed in the 1 869 col cell line was then isolated by carrying out a rapid amplification of 
cDNA ends (RACE) reaction using nested internal primers complementary to the sequence of 
the 1D8 clone. Sequencing of products that were cloned from this reaction indicated that a 1412 
bp product represented the predominant transcript of the gene in the 1869 col cell line that 
encoded the antigen recognised by CI 1 1 T cells, which was designated colorectal antigen- 1 
(COA-1) (Fig 3). 

Comparison of the COA-1 sequence with the genomic DNA sequence database indicated 
that this product was derived from 13 exons, but at least two additional alternatively spliced 
products of this gene were isolated from the RACE reaction. An alignment of the COA-1 
transcript with the human EST database indicated that this was identical or nearly identical to 
several sequences obtained from normal human brain, placenta, ovary, and testis, as well as 
sequences obtained from a variety of adenocarcinomas. 

The 5' end of the transcript cloned from the RACE reaction corresponded to the 5' end of 
several EST sequences found in the database, and the 3' end of the original cDNA clone 
corresponded to the 3' end of the EST transcripts derived from several cell lines, indicating that 
these may represent the authentic 5' and 3' ends of the predominant COA-1 colon tumour cell 
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transcript. The COA-1 sequence was also nearly identical to that of a transcript encoding the 
human homologue of the rat Socius protein, a molecule that was recently cloned on the basis of 
its ability to bind to a member of the Rnd family of GTPases (20). 

Forward and reverse primers located at or near the 5' and 3' ends of the putative COA-1 
gene product were then used to carry out an RT-PCR from 1869 RNA, as the RACE products 
that had been cloned only comprised a portion of the normal transcript. When RT-PCR was 
carried out with several primers that were proximal to the putative 5' end of the transcript in 
combination with primers that were complementary to the highly repetitive G/C rich sequence 
near to the 3' end of the COA-1 transcript, a variety of non-specific transcripts were generated 
(data not shown). A product that was designated COA-la was, however, successfully amplified 
from 1869 col RNA using two primers that encompassed the region between nucleotides 290 and 
1318 of the putative full length COA-1 transcript. 

Transfectants that co-expressed the COA-la gene along with either HLA-DRp 1*0402 or 
1301, appeared to stimulate comparable levels of cytokine release from CI 1 1 T cells to those 
transfected with the truncated 1D8 cDNA clone, showing that the full length gene can be 
processed relatively efficiently (Fig. 2). Co-transfection of the COA-la gene with a construct 
encoding the full length human invariant chain (Ii) had little or no effect on the recognition of 
target cells transfected with the COA-la product by Cl 1 1 T cells. Thus, either the levels of Ii 
expression in 293 cells that were also transfected with a construct encoding the CIITA gene 
product was adequate for recognition of this epitope, or Ii expression does not have a significant 
impact on the processing of the COA-1 epitope. 

In addition, the COA-la product was not fused with amino acids one to 80 of the human 
Ii molecule, which had previously been shown to enhance the recognition of some HLA class II 
antigens (21). 

The observation that the fusion of the cDNA clone with the invariant chain did not 
enhance recognition by the CD4+ T cells shows that the COA-1 antigen may naturally target the 
endogenous HLA class II processing pathway in colon tumour cells. 

The expression pattern of the COA-1 gene was then examined in several colorectal, 
melanoma, and EBV-B cell lines, as well as in several normal cell lines which included CD40L 
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stimulated B cell and fibroblast cell lines. The results of Northern blot analysis indicated that 
this gene was expressed at relatively low levels in colon and melanoma tumour cell lines, EBV B 
cells, normal B cells and fibroblasts, and quantitative TaqMan RT-PCR indicated that the levels 
of expression did not differ significantly between these cells (data not shown). 

The observation that the level of expression of the COA-1 gene did not differ 
significantly between cell lines that were or were not recognised by CI 1 1 T cells, showed that 
these cells express similar but non- identical products. Therefore, transcripts of the COA-1 gene 
that were expressed in the autologous and allogeneic CD40L stimulated B cells, as well as 
allogeneic fibroblast cell lines, were isolated using RT-PCR and sequenced. 

The results of sequencing carried out with the bulk RT-PCR products showed that 
CD40L stimulated B cells and fibroblast cell lines predominantly expressed products that 
appeared to be identical to the COA-1 transcript derived from 1869 col cells with the exception 
of a single substitution of a T for a C residue at nucleotide position 1280, resulting in a change at 
amino acid 399. 

The COA-1 transcripts that were expressed in CD40L B cells were isolated by carrying 
out RT-PCR and cloning the resultant products. Ten out often clones from the CD40L B cells 
that were sequenced contained a T at position 1280 but were otherwise identical to the 1869 col 
COA-1 transcript. 

Amplification of the COA-1 gene product from allogeneic colorectal tumour lines 
SW1463, SW480 and 1847 col, as well as the 1681 mel line, showed that these cells 
predominantly expressed products containing a C residue at position 1280, as determined by 
sequencing the bulk, un-cloned RT-PCR products that were amplified from these cells (data not 
shown). Two peaks of comparable heights that corresponded to C and T residues at position 
1280 of the COA-1 transcript were derived by sequencing the un-cloned RT-PCR product from 
autologous EBV B cells, indicating that these products may be expressed at similar levels in 
these cells. The results obtained using RNA from autologous CD40L stimulated B cells, EBV B 
cells, and the colon tumour cell lines were corrfirmed by repeated analysis carried out on 
products obtained from four independent RT-PCR reactions, showing that the residue found at 
nucleotide 1280 of the COA-1 transcripts did not represent a PCR mutation (data not shown). 
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To evaluate the significance of the single base pair change at position 1280 in the COA- 
la sequence, the RT-PCR products obtained from autologous CD40L stimulated B cells were 
cloned in a eukaryotic expression vector. A plasmid containing the COA-1 a transcript that was 
amplified from the normal B cells was then compared with products cloned from 1869 col cells 
for its ability to sensitise 293-DR*0402 or 293-DR*1301 cells for recognition by CI 1 1 T cells. 
Target cells expressing either of the autologous HLA-DR genes that were transfected with the 
COA-1 a or 1D8 gene products, but not the product that was isolated from CD40L activated B 
cells, stimulated cytokine release from CI 1 1 T cells (Fig. 4). These results showed that there 
was a correlation between the recognition of normal B cells and tumour cells and the ability of 
the COA-1 gene products that were expressed by these cells to sensitise targets for recognition 
by CI 11 T cells. 



Identification of the epitope recognised by the CD4 + clone Clll. 

The results of transfection studies carried out using truncated COA-1 gene products 
showed that the CI 1 1 T cell epitope was encoded by a region located between nucleotides 1 121 
and 1288 of the COA-1 transcript. The longest open reading frame in the COA-1 transcript, 
which overlapped with the Socius gene product (20), was utilised as the basis for the synthesis of 
peptides that were used to identify the T cell epitope recognised by CI 1 1 T cells. 

Peptides that were 20 or 21 amino acids in length and that overlapped by either 14 or 16 
amino acids, were than synthesised and tested for their ability to sensitise target cells for 
recognition by CI 1 1 T cells. Since autologous normal B cells could not be efficiently expanded, 
allogeneic normal B cells expressing either DR(3 1 *0402 or DRp 1*1301 were used to carry out 
these assays. 

The 1681 and 1847 CD40L stimulated normal B cell lines shared expression of HLA- 
DRpi*0402 and HLA-DR(31*1301 molecules, respectively, with the autologous tumour cell 
line. These cells were incubated with the panel of peptides and then tested for their ability to 
stimulate cytokine release from Clll T cells. The results showed that 1681 and 1847 CD40L B 
cells that were pulsed with either of the two overlapping peptides 

FSTFPPTLYQDDTLTLQAAG (SEQ ED NO 17) and TLYQDDTLTLQAAGLVPKAA (SEQ 
ID NO 18) stimulated significant cytokine release from CI 1 1 T cells. 
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These T cells thus recognise the peptide TLYQDDTLTLQAAG (SEQ ID NO 6), which 
represents the overlapping region in these peptides. The L at position two, the T at the position 7 
and L at position 10 in this sequence conform to an HLA binding motif that has been identified 
for the HLA-DRp 1*0402 class II allele (22). However, it was not possible to identify the 
potential anchor residues in this sequence that were involved in binding to the HLA- 
DRpi*1301 allele. Nevertheless, these observations show that CI 11 T cells recognise a single 
peptide epitope in the context of either the HLA-DRp 1*0402 or 1301 class II gene products. 
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A 


B 


C 


DR(51 


DRP3-5 


DQ 


1869 


3,24 


35,38 


0401,1203 


0402,1301 


3*01,4*01 


03,06 


1870 


24 


35 


04 


1202 


3*03 


03 


1872 


02, 03 


07, 4402 


0501, 0702 


0401,1501 


4*01, 5*01 


03, 06 


1681 


01,0201 


08,44 


N.D. 


0301,0402 


3*0101,4*01 0301,0402 


1847 


02 


18,44 


05, 0701 


0401,1301 


3*01,4*01 


03, 06 


1519 


24,32 


1401,4402 


05,08 


0701,1301 


3*01,4*01 


02, 06 
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Table 2 Specific recognition of colon cancer Ones by CD4 + clones from patient 1869. 



Target cells 


a 

Antibody 


HLA-DRB1 


b 

Tcell 










C4 


C49 


cm 




None 








<8 


<8 


None 








< 5 


1 O /C A 1 1 TT~ , ~V T ^ 

1869 col + IFN-y 


None 




*0402, *1301 


234 


1213 


536 




W6/32 






212 


1100 


442 




L243 






107 


97 


17 


1869 col CIITA 


None 






536 


5178 


5005 




W6/32 






527 


4987 


4249 




L243 






47 


254 


305 


1870 col + IFN-v 


None 




1ZUZ 


<8 


<8 


<8 




W6/32 






<8 


<8 


<8 




L243 






<8 


<7.8 


<8 


1872 col + IFN-y 


None 




*0401, *1501 


<8 


<8 


<8 




W6/32 






<8 


<8 


<8 




L243 






<8 


<8 


<8 


SW480 CUTA 


None 




*0103, *1301 


879 


968 


963 




W6/32 






780 


902 


996 




L243 






571 


129 


127 


Colo 205 CIITA 


None 




*0401, *1301 


68 


942 


686 




W6/32 






76 


951 


669 




L243 






78 


170 


489 


1869EBV-B 






*0402, *1301 


52 


126 


322 



a. Target cells were pre-incubated for 1 hour with either the anti-MHC class I mAb W6/32 or the anti- 
HLA DR mAb L243 before addition to T cells. 

b. 2xl0 4 T cells were incubated with 5xl0 4 target cells in flat bottom 96-well plate in 250 pi of AIMV 2% 
HS. After 18 hrs. the supernatants IFN-y secretion was evaluated by ELISA 

c. pg/ml of IFN-y. 

d. Where indicate, target cells were pre-incubated for 48 hrs with 500 IU of IFN-y. 
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Table 3 CD4 clones recognised colon cancer lines but not normal B or fibroblast cells sharing 
MHC class II molecules 



A 



Stimulator 


Antibody 8 


HLA- 






T cell" 






• 




DRpl 


















TBL 




PBL 










C5 


C15 






1 1 1 


None 


None 




<8 C 


<8 


<8 


<8 


<8 


1869 colCUTA 


None 


*0402, 


8695 


1259 


12328 


12749 


15269 




c 

L243 




279 


162 


511 


524 


790 


d 

1847 col + IFN-Y 


None 


*0401, 


2008 


457 


598 


9758 


11576 




L243 




2055 


327 


585 


790 


2938 


1872 col + IFN-y 


None 


*0401, 


72 


<8 


61 


<8 


66 


L243 




75 


<8 


60 


<8 


41 


1869EBV-B 


None 


*0402, 


79 


116 


122 


232 


209 


1519EBV-B 


None 


*0701, 


112 


24 


99 


106 


220 


1519 Fibroblast + 


None 


*0701, 


<8 


<8 


<8 


55 


62 


1869CD40LB 6 


None 


*0402, 


<8 


<8 


<8 


<8 


45 



B 



Stimulator 


Antibody a 


T cell b 






cm 


None 




23 


1869 col CIITA 




15269 


1869 col CIITA 


HLA-DR ° 


790 


1681 mel+ IFN-y 




10298 


1681 niel+ IFN-y 


HLA-DR 


253 


1869 B cells 




65 


1681 B cells 




22 



a. Where indicated, target cells were pre-incubated for 1 hour with the anti-HLA DR mAb L243. 

b. 2x10 4 of the indicated T cells were incubated with 5xl0 4 target cells in flat bottom 96-well plate in 
ul of A1MV 2% HS. After 18 hrs. the supernatants IFN-y secretion was evaluated by ELISA. 

c. pg/ml of IFN-y. 

d. Where indicated, target cells were pre-incubated for 48 hrs with 500 IU of IFN-y. 

e. B cells from the patient 1 869 were in vitro cultured with CD40L (100 KJ/ml) and EL-4 (100 IU/ml). 
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Table 4 Identification of the CO A-l -derived epitopes recognised by the CD4 + clone Oil. 



Stimulator 

None 
1869 col 

1681 
CD40LB 
1847 
CD40LB 


HLA- 
DRpl 

0402, 
1301 
0301, 
0402 
0401, 
1301 


No peptide 

a 

<8 
2186 

<8 

<8 








b 

Peptide 




















Peptide Cone, (jig/ml) 








100 


50 


25 


12.5 


6.25 


1681 


0301, 


FSTFPPTLYQDDTLTLQAAG 


105 


236 


69 


<7.8 


<7.8 


CD40LB 


0402 














1681 




TLYQDDTLTLQAAGLVPKAA 


51 


159 


<7.8 


<7.8 


<7.8 


CD40LB 
















1 £21 
J.UO 1 




nrvrr tt n a a r;T vpit a a t t t t? a 


11 


16 


<7.8 


<7.8 


<7.8 


CD40LB 
















1681 




LQAAGLVPKAALLLRARRAP 


21 


12 


<7.8 


<7.8 


<7.8 


CD40LB 
















1847 


0401, 


ASAFEIFSTFPPTLYQDDTL 


<7.8 


<7.8 


<7.8 


<7.8 


<7.8 




1 














1847 




FSTFPPTLYQDDTLTLQAAG 


226 


397 


296 


79 


<7.8 


CD40LB 
















1847 




TLYQDDTLTLQAAGLVPKAA 


79 


326 


<7.8 


<7.8 


<7.8 


CD40LB 
















1847 




DDTLTLQAAGLVPKAALLLRA 


22 


33 


<7.8 


<7.8 


<7.8 


CD40LB 
















1847 




LQAAGLVPKAALLLRARRAP 


52 


32 


<7.8 


<7.8 


<7.8 


CD40LB 

















a. The CD4 + T cell clone CI 1 1 was the added at 2xl0 4 cells/well at the final volume of 250 (iVwell 
of ISCOVE's plus 10 % HS and after 18 hrs. of incubation the supernatants were collected and the 
EFN-y release was evaluated by ELISA. 

b. Peptides of 20 or 21 amino acids overlapping by 15 amino acids were synthesised using the 
putative COA-1 protein, in the 1D8 region (1012-1318 bp). 4xl0 5 /ml of B cells sharing one of the 
DRpi molecules (*0402 or *1301) with the autologous tumour 1869, were incubated for three hrs. at 
37°C and 5% C0 2 in the presence or not (-) of the peptides at the final volume of 100 u.l/well in 
ISCOVE's plus 10% HS. 
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Claims 

1 The use of a peptide comprising all or an immunogenic part of the amino acid sequence 
designated SEQ ID NO 6 in the manufacture of a vaccine to stimulate an immune response 
against COA-1 (SEQ ID NO 2). 

2 Use according to claim 1, wherein the immunogenic part of the sequence comprises 8 or 
more contiguous amino acid residues of SEQ ID NO 6. 

3 Use according to claim 2, wherein the immunogenic part of the sequence comprises 10 or 
more contiguous amino acid residues of SEQ ED NO 6. 

4 Use according to any preceding claim, wherein the immunogenic part of the sequence 
comprises SEQ ID NO. 9 at the N4erminus and/or SEQ ID NO. 10 at the C-terminus. 

5 Use according to claim 1 ,wherein the immunogenic part of the sequence consists of SEQ 
ID NO 6. 

6 Use according to any preceding claim, wherein the immune response is stimulated 
against Colorectal Cancer cells. 

7 Use according to any preceding claim, wherein the peptide is an oligopeptide. 

8 Use according to any preceding claim, wherein the immunogenic part of the sequence is 
processed and expressed by antigen presenting cells in association with sympathetic MHC class 
II molecules. 

9 Use according to claim 8, wherein the MHC class Ef molecules are the HLA DR|31 *0402 
and/or HLA DRpl*1301 alleles. 

10 Use according to any preceding claim, wherein the vaccine further comprises PBMC's 
(Peripheral Blood Mononuclear Cells) either expressing the HLA DR01 *0402 and/or the HLA 
DRpl*1301 alleles. 

1 1 A peptide comprising the amino acid sequence designated SEQ ID NO 6, wherein an 
immunogenic response is raised by adrninistration thereof. 

12 A peptide as defined in any of claims 2-5. 



WO 2005/039632 PCT/EP2004/012087 

35 

13 A peptide consisting of a peptide as defined in any of claims 2-5 

1 4 A polynucleotide encoding a peptide, as defined in any preceding claim. 

1 5 A vaccine comprising a peptide, as defined in any preceding claim. 

16 A vaccine according to claim 16 comprising a suitable carrier. 

1 7 A vaccine according to any of claims 15-16, comprising the peptide and PBMC's 
expressing a sympathetic MHC Class II allele therefor. 

1 8 A vaccine according to claim 1 7, wherein the MHC Class II allele is the HLA 
DRp 1*0402 and/or the HLA DRpi*1301 allele. 

19 A host cell expressing a peptide as defined in any of claims 1-16. 

20 An antibody raised against a peptide as defined in any of claims 1-16. 

21 A method for stimulating immunity against colorectal cancer, comprising stimulating the 
production of antibodies against a peptide, as defined in any of claims 1-16. 

22. A method according to claim 2 1 , wherein immunity is stimulated in the patient in 
conjunction with PBMC's allogeneic or autologous for at least one sympathetic HLA-II allele 
capable of presenting all or an immunogenic part of the amino acid sequence designated SEQ ID 
NO 6 in an immunogenic manner. 

23. A method according to claim 22, wherein the allele is selected from HLA DR.pi *0402 
andHLADRpl*1301. 

24. A method according to any of claims 21-23, wherein the patient has PBMC's autologous 
or allogeneic for at least one sympathetic HLA-II allele capable of presenting the COA-1 epitope 
in an immunogenic manner, the method comprising administering a vaccine comprising the 
immunising portion of COA-1, or a precursor therefor, as defined in any preceding claim, to the 
patient. 

25 . A method for stimulating immunity to colorectal cancer in a patient, said method 
comprising 

i) isolating PBMC's or their progenitors from the patient and transforming said cells with at 
least one sympathetic HLA-II allele capable of presenting the COA-1 epitope in an immunogenic 
manner, 

ii) introducing the transformed PBMC's back into the patient, and 
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iii) administering a vaccine comprising the immunising portion of COA-1 , or a precursor 
therefor, as defined in any of claims 1 to 16, to the patient. 

26. A method according to claim 25, wherein the immunising portion of COA- 1 , or a 
precursor therefor, is administered with the transformed PBMC's. 

27. A passive vaccine comprising an antibody according to claim 20. 

28. A diagnostic assay for colorectal cancer comprising an antibody according to claim 20. 
29 Use according to any of claims 1 -4, wherein the immune response is stimulated against 
melanoma cells. 
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MAFMTRKLWD LEQQVKAQTD EILSKDQKIA ALEDLVQTLp. PHPAEATLQR QEELETMCVQ 60 
LQRQVREMER FLSDYGIiQWV GEPMDQEDSE SKTVSEHGER DWMTAKKFWK PGDSLAPPEV 120 
DFDRLliASLQ DLSELWEGD TQVTPVPGGA RliRTLEPIPL KLYRNGIMMF DGPFQPFYDP 180 
STQRCLRDIL DGFFPSELQR LYPNGVPFKV SDLRNQVYLE DGLDPFPGEG RWGRQRMHK 24 0 
ALDRVEEHPG SRMTAEKFLN RLPKFVIRQG EVIDIRGPIR DTLQNCCPLP ARIQEIWET 3 00 
PTLAAERERS QESPNTPAPP LSMLRIKSEN GEQAFLLMMQ PDNTIGDVRA LLAQARVMDA 360 

SAFBI FSTFP PTLYQDDTLT LQAAGLVPKA ALLLRARrAp KSSLKFSPGP CPGPGPGPSP 420 



GPGPGSSPCP GPSPSPQ 




437 



Alanine at 
position 399 
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cgctgcggga cggctagcgg ccctgcgtgg aggcgaggaa tccgcatcta tggagatgtc 60 

cctgcatccc atgactcgga gctg atg gcc ttc atg acg agg aag ttg tgg m 

Met Ala Phe Met Thr Arg Lys Leu Trp 
1 5 

gac ctg gag cag cag gtg aag gcc cag act gat gag ata ctg tec aag 159 
Asp Leu Glu Gin Gin Val Lys Ala Gin Thr Asp Glu lie Leu Ser Lys 
10 15 20 25 

gat cag aag ata gcg gcc eta gag gac ctg gtg cag acc etc egg cca 2 07 
Asp Gin Lys lie Ala Ala Leu Glu Asp Leu Val Gin Thr Leu Arg Pro 
30 35 40 

cac cca gcc gag gca acc ctg cag egg cag gag gaa ctg gag acg atg 255 
His Pro Ala Glu Ala Thr Leu Gin Arg Gin Glu Glu Leu Glu Thr Met 
45 50 55 

tgt gtg cag ctg cag egg cag gtc agg gag atg gag egg ttc etc agt 303 
Cys Val Gin Leu Gin Arg Gin Val Arg Glu Met Glu Arg Phe Leu Ser 
60 65 70 

gac tat ggc ctg cag tgg gtg ggc gag ccc atg gac cag gag gac tea 351 
Asp Tyr Gly Leu Gin Trp Val Gly Glu Pro Met Asp Gin Glu Asp Ser 
75 80 85 

gag age aag aca gtc tea gag cat ggc gag agg gac tgg atg aca gcc 399 
Glu Ser Lys Thr Val Ser Glu His Gly Glu Arg Asp Trp Met Thr Ala 
90 95 100 105 

aag aag ttc tgg aag cca ggg gac tea ttg gcg ccc cct gag gtg gac 447 
Lys Lys Phe Trp Lys Pro Gly Asp Ser Leu Ala Pro Pro Glu Val Asp 
110 115 120 

ttt gac agg ctg ctg gcc age ctg cag gat ctt agt gag ctg gtg gta 495 
Phe Asp Arg Leu Leu Ala Ser Leu Gin Asp Leu Ser Glu Leu Val Val 
125 130 135 

gag ggt gac acc caa gtg aca cca gtg ccc ggc ggg gca egg ctg cgt 543 
Glu Gly Asp Thr Gin Val Thr Pro Val Pro Gly Gly Ala Arg Leu Arg 
140 145 150 

acc etc gag ccc ate ccg ctg aag etc tac egg aat ggc ate atg atg 591 
Thr Leu Glu Pro He Pro Leu Lys Leu Tyr Arg Asn Gly He Met Met 
155 160 165 

ttc gac ggg ccc ttc cag ccc ttc tac gat ccc tec aca cag cgc tgc 639 
Phe Asp Gly Pro Phe Gin Pro Phe Tyr Asp Pro Ser Thr Gin Arg Cys 
170 175 180 185 

etc cga gac ata ttg gat ggc ttc ttt ccc tea gag etc cag cga ctg 687 
Leu Arg Asp lie Leu Asp Gly Phe Phe Pro Ser Glu Leu Gin Arg Leu 
190 195 200 

tac ccc aat ggg gtc ccc ttt aag gtg agt gac ttg cgc aat cag gtc 735 
Tyr Pro Asn Gly Val Pro Phe Lys Val Ser Asp Leu Arg Asn Gin Val 
205 210 215 



FIG. 5 



WO 2005/039632 PCT/EP2004/012087 

8/9 

tac ctg gag ejat gga ctg gac ccc ttc cca ggc gag ggc cgt gtg gtg 7 83 

Tyr Leu Glu Asp Gly Leu Asp Pro Phe Pro Gly Glu Gly Arg Val Val 



220 225 



230 



ggc agg cag egg atg cac aag gec ttg gac agg gtg gag gag cac cca 831 
Gly Arg Gin Arg Met His Lys Ala Leu Asp Arg Val Glu Glu His Pro 
235 240 245 

ggc tec agg atg act get gag aaa ttt ctg aac agg etc ccc aag ttt 879 
Gly Ser Arg Met Thr Ala Glu Lys Phe Leu Asn Arg Leu Pro Lys Phe 
250 255 260 265 

gtg ate egg caa ggc gag gtg att gac ate egg ggc ccc ate agg gac 927 
Val lie Arg Gin Gly Glu Val lie Asp lie Arg Gly Pro He Arg Asp 
270 275 280 

ace ttg cag aac tgc tgc cca ttg cct gec egg ate cag gag att gtg 975 
Thr Leu Gin Asn Cys Cys Pro Leu Pro Ala Arg He Gin Glu He Val 
285 290 295 

gtg gag acg ccc acc ttg gec get gag cga gag agg age cag gag tea 1023 
Val Glu Thr Pro Thr Leu Ala Ala Glu Arg Glu Arg Ser Gin Glu Ser 
300 305 310 

ccc aac aca ccg gca ccc ccg etc tec atg ctg cgc ate aag tct gag 1071 
Pro Asn Thr Pro Ala Pro Pro Leu Ser Met Leu Arg He Lys Ser Glu 
315 320 325 

aat ggg gaa cag gec ttc eta ctg atg atg cag cct gac aac acc att 1119 
Asn Gly Glu GJln Ala Phe Leu Leu Met Met Gin Pro Asp Asn Thr He 
330 335 340 345 

ggg gac gtg cga get ctg eta gcg cag gec agg gtc atg gat gec tct 1167 
Gly Asp Val Arg Ala Leu Leu Ala Gin Ala Arg Val Met Asp Ala Ser 
350 355 360 

gee ttt gag ate ttc age aca ttc ccg ccc acc etc tac cag gac gat 1215 
Ala Phe Glu rie Phe Ser Thr Phe Pro Pro Thr Leu Tyr Gin Asp Asp 
365 370 375 

aca etc acg ctg cag get gca ggc ctt gtg ccc aaa gca gca ctg ctg 1263 
Thr Leu Thr Leu Gin Ala Ala Gly Leu Val Pro Lys Ala Ala Leu Leu 
380 385 390 



Cytosine at position 1280 



ctg egg gca cgc cga gCc ccg aag tec age ctg aaa ttc agt cct ggt 
1311 

Leu Arg Ala Arg Arg Ala Pro Lys Ser Ser Leu Lys Phe Ser Pro Gly 
395 k- 400 405 



Alanine at position 399 



ccc tgt ccc ggt ccc ggt ccc ggc ccc agt ccc ggt ccc ggt ccc ggc 1359 
Pro Cys Pro Gly Pro Gly Pro Gly Pro Ser Pro Gly Pro Gly Pro Gly 
410 415 420 ' 425 



FIG. 5 cont. 
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tec agt ccc tgt ccc gga ccc agt ccc age ccc caa taaagcaccc 1405 

Ser Ser Pro Cya Pro Gly Pro Ser Pro Ser Pro Gin 

430 435 * 

accccctc 1413 



FIG. 5 cont. 
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SEQUENCE LISTING 

<110> Istituto Superiore di Sanita 
National Institutes of Health 

<120> COLORECTAL ANTIGEN 

<130> WPP88367 

<150> US 60/512,040 

<151> 2003-10-15 

<160> 20 

<170> Patentln version 3.3 

<210> 1 

<211> 1413 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (85).. (1395) 

<220> 

<221> misc_f eature 
<222> (1130) .. (1240) 

<223> nucleotide sequence encoding the immunogenic peptide 
<400> 1 

cgctgcggga cggctagcgg ccctgcgtgg aggcgaggaa tccgcatcta tggagatgtc SO 

cctgcatccc atgactcgga gctg atg gcc ttc atg acg agg aag ttg tgg 111 

Met Ala Phe Met Thr Arg Lys Leu Trp 
1 5 

gac ctg gag cag cag gtg aag gcc cag act gat gag ata ctg tec aag 159 
Asp Leu Glu Gin Gin Val Lys Ala Gin Thr Asp Glu He Leu Ser Lys 
10 15 20 25 

gat cag aag ata gcg gcc eta gag gac ctg gtg cag acc etc egg cca 207 
Asp Gin Lys He Ala Ala Leu Glu Asp Leu Val Gin Thr Leu Arg Pro 
30 35 40 

cac cca gcc gag gca acc ctg cag egg cag gag gaa ctg gag acg atg 255 
His Pro Ala Glu Ala Thr Leu Gin Arg Gin Glu Glu Leu Glu Thr Met 
45 50 55 

tgt gtg cag ctg cag egg cag gtc agg gag atg gag egg ttc etc agt 3 03 

Cys Val Gin Leu Gin Arg Gin Val Arg Glu Met Glu Arg Phe Leu Ser 
60 65 70 

gac tat ggc ctg cag tgg gtg ggc gag ccc atg gac cag gag gac tea 3 51 

Asp Tyr Gly Leu Gin Trp Val Gly Glu Pro Met Asp Gin Glu Asp Ser 
75 80 35 

gag age aag aca gtc tea gag cat ggc gag agg gac tgg atg aca gcc 399 
Glu Ser Lys Thr Val Ser Glu His Gly Glu Arg Asp Trp Met Thr Ala 
90 95 100 105 

aag aag ttc tgg aag cca ggg gac tea ttg gcg ccc cct gag gtg gac 447 
Lys Lys Phe Trp Lys Pro Gly Asp Ser Leu Ala Pro Pro Glu Val Asp 
110 115 120 

ttt gac agg ctg ctg gcc age ctg cag gat ctt agt gag ctg gtg gta 495 
Phe Asp Arg Leu Leu Ala Ser Leu Gin Asp Leu Ser Glu Leu Val Val 



1 



WO 2005/039632 PCT/EP2004/012087 

125 130 13 5 

gag ggt gac acc caa gtg aca cca gtg ccc ggc ggg gca egg ctq cot 543 
Glu Gly Asp Thr Gin Val Thr Pro Val Pro Gly Gly Ala Arg Leu S 
140 145 150 

acc etc gag ceo ate ccg ctg aag etc tac egg aat ggc ate atg atg 591 
Thr Leu Glu Pro lie Pro Leu Lys Leu Tyr Arg Asn Gly He Met Met 
155 160 165 

ttc gac ggg ccc ttc cag ccc ttc tac gat ccc tec aca cag cgc tqc 639 
Phe Asp Gly Pro Phe Gin Pro Phe Tyr Asp Pro Ser Thr Gin Arq cvs 
170 175 iso 185 

etc cga gac ata ttg gat ggc ttc ttt ccc tea gag etc cag cga ctg 687 
Leu Arg Asp lie Leu Asp Gly Phe Phe Pro Ser Glu Leu Gin Arg Leu 
190 195 200 

tac ccc aat ggg gtc ccc ttt aag gtg agt gac ttg cgc aat cag gtc 735 
Tyr Pro Asn Gly Val Pro Phe Lys Val Ser Asp Leu Arg Asn Gin Val 
205 210 215 

tac ctg gag gat gga ctg gac ccc ttc cca ggc gag ggc cgt gtg gtg 783 
Tyr Leu Glu Asp Gly Leu Asp Pro Phe Pro Gly Glu Gly Arg Val Val 
220 225 230 

ggc agg cag egg atg cac aag gee ttg gac agg gtg gag gag cac cca 831 
Gly Arg Gin Arg Met His Lys Ala Leu Asp Arg Val Glu Glu His Pro 
235 240 245 

ggc tec agg atg act get gag aaa ttt ctg aac agg etc ccc aag ttt 879 
Gly Ser Arg Met Thr Ala Glu Lys Phe Leu Asn Arg Leu Pro Lys Phe 
250 255 260 265 

S«? Ti tC f" 9 ?° 939 9tS att 9aC atC C " 99° ccc atc a 99 9ac 927 

Val He Arg Gin Gly Glu Val He Asp lie Arg Gly Pro lie Arg Asp 
270 275 



280 



acc ttg cag aac tgc tgc cca ttg cct gee egg atc cag gag att gtg 
Thr Leu Gin Asn Cys Cys Pro Leu Pro Ala Arg lie Gin Glu lie Val 
285 290 



295 



ccc aac aca ccg gca ccc ccg etc tec atg ctg cgc atc aag tct gag 
Pro Asn Thr Pro Ala Pro Pro Leu Ser Met Leu Arg He Lys Ser Glu 
315 320 



325 



975 



w»? ^ 9 T Z° 9 9CC 9Ct 939 C9a 9*9 agg age cag gag tea 1023 
Val Glu Thr Pro Thr Leu Ala Ala Glu Arg Glu Arg Ser Gin Glu Ser 
300 305 310 



1071 



aat ggg gaa cag gee ttc eta ctg atg atg cag cct gac aac acc att 1119 
Asn Gly Glu Gin Ala Phe Leu Leu Met Met Gin Pro Asp Asn Thr He 
330 335 340 345 

939 gac gtg cga get ctg eta gcg cag gee agg gtc atg gat gee tct 1167 
Gly Asp Val Arg Ala Leu Leu Ala Gin Ala Arg val Met Asp Ala Ser 
350 355 360 

?? C It* 9 ? 9 at ° ttC a9C aca ttc cc 9 ccc a =c etc tac cag gac gat 1215 
Ala Phe Glu He Phe Ser Thr Phe Pro Pro Thr Leu Tyr Gin Asp Asp 
365 3 70 3?5 

aca etc acg ctg cag get gca ggc ctt gtg ccc aaa gca gca ctg ctg 1263 
Thr Leu Thr Leu Gin Ala Ala Gly Leu Val Pro Lys Ala Ala Leu Leu 
380 385 3go 

ctg egg gca cgc cga gee ccg aag tec age ctg aaa ttc agt cct ggt 1311 
Leu Arg Ala Arg Arg Ala Pro Lys Ser Ser Leu Lys Phe Ser Pro Gly 



2 



WO 2005/039632 

395 
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400 



405 



ccc tgt ccc ggt ccc ggt ccc ggc ccc agt ccc ggt ccc ggt ccc ggc 
Pro Cys Pro Gly Pro Gly Pro Gly Pro Ser Pro Gly Pro Gly Pro Gly 
410 415 420 425 



1359 



tec agt ccc tgt ccc gga ccc agt ccc age ccc caa taaagcaccc 
Ser Ser Pro Cys Pro Gly Pro Ser Pro Ser Pro Gin 
430 435 



1405 



accccctc 



1413 



<210> 2 

<211> 437 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Ala Phe Met Thr Arg Lys Leu Trp Asp Leu Glu Gin Gin Val Lys 
15 10 15 



Ala Gin Thr Asp Glu lie Leu Ser Lys Asp Gin Lys lie Ala Ala Leu 
20 25 30 



Glu Asp Leu Val Gin Thr Leu Arg Pro His Pro Ala Glu Ala Thr Leu 
35 40 45 



Gin Arg Gin Glu Glu Leu Glu Thr Met Cys Val Gin Leu Gin Arg Gin 
50 55 60 



Val Arg Glu Met Glu Arg Phe Leu Ser Asp Tyr Gly Leu Gin Trp Val 
65 70 75 80 



Gly Glu Pro Met Asp Gin Glu Asp Ser Glu Ser Lys Thr Val Ser Glu 
85 90 95 



His Gly Glu Arg Asp Trp Met Thr Ala Lys Lys Phe Trp Lys Pro Gly 
100 105 110 



Asp Ser Leu Ala Pro Pro Glu Val Asp Phe Asp Arg Leu Leu Ala Ser 
115 120 ' 125 



Leu Gin Asp Leu Ser Glu Leu Val Val Glu Gly Asp Thr Gin Val Thr 
130 135 140 



Pro Val Pro Gly Gly Ala Arg Leu Arg Thr Leu Glu Pro lie Pro Leu 
145 150 155 160 



Lys Leu Tyr Arg Asn Gly lie Met Met Phe Asp Gly Pro Phe Gin Pro 
165 170 175 



Phe Tyr Asp Pro Ser Thr Gin Arg Cys Leu Arg Asp lie Leu Asp Gly 
180 185 190 
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Phe Phe Pro Ser Glu Leu Gin Arg Leu Tyr Pro Asn Gly Val Pro Phe 
195 200 205 

Lys Val Ser Asp Leu Arg Asn Gin Val Tyr Leu Glu Asp Gly Leu Asp 

Pro Phe Pro Gly Glu Gly Arg Val Val Gly Arg Gin Arg Met His Lys 
225 230 235 2 l 0 

Ala Leu Asp Arg Val Glu Glu His Pro Gly Ser Arg Met Thr Ala Glu 
245 250 255 

Lys Phe Leu Asn Arg Leu Pro Lys Phe Val lie Arg Gin Gly Glu Val 
260 265 270 

lie Asp lie Arg Gly Pro lie Arg Asp Thr Leu Gin Asn Cys Cys Pro 
275 280 285 

Leu Pro Ala Arg lie Gin Glu He Val Val Glu Thr Pro Thr Leu Ala 
290 295 300 

Ala Glu Arg Glu Arg Ser Gin Glu Ser Pro Asn Thr Pro Ala Pro Pro 



305 310 315 



320 



Leu Ser Met Leu Arg lie Lys Ser Glu Asn Gly Glu Gin Ala Phe Leu 
325 330 335 

Leu Met Met Gin Pro Asp Asn Thr He Gly Asp Val Arg Ala Leu Leu 

340 345 35 0 

Ala Gin Ala Arg Val Met Asp Ala Ser Ala Phe Glu He Phe Ser Thr 
355 360 365 

Phe Pro Pro Thr Leu Tyr Gin Asp Asp Thr Leu Thr Leu Gin Ala Ala 
370 375 380 

Gly Leu Val Pro Lys Ala Ala Leu Leu Leu Arg Ala Arg Arg Ala Pro 
385 390 395 ~ ~ 400 

Lys Ser Ser Leu Lys Phe Ser Pro Gly Pro Cys Pro Gly Pro Gly Pro 
405 4 10 41S 

Gly Pro Ser Pro Gly Pro Gly Pro Gly Ser Ser Pro Cys Pro Gly Pro 
420 425 4 3o 

Ser Pro Ser Pro Gin 
435 



<210> 3 

<211> 60 

<212> DNA 

<213> Homo sapiens 
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<400> 3 

ttcagcacat tcccgcccac cctctaccag gacgatacac tcacgctgca ggctgcaggc 60 

<210> 4 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Phe Ser Thr Phe Pro Pro Thr Leu Tyr Gin Asp Asp Thr Leu Thr Leu 
1 5 10 is 

Gin Ala Ala Gly 
20 



<210> 5 

<211> 42 

<212> DNA 

<213> Homo sapiens 

<400> 5 

accctctacc aggacgatac actcacgctg caggctgcag gc 42 

<210> 6 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> S 

Thr Leu Tyr Gin Asp Asp Thr Leu Thr Leu Gin Ala Ala Gly 
1 5 10 

<210> 7 

<211> 1028 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ctcagtgact atggcctgca gtgggtgggc gagcccatgg accaggagga ctcagagagc 60 

aagacagtct cagagcatgg cgagagggac tggatgacag ccaagaagtt ctggaagcca 120 

ggggactcat tggcgccccc tgaggtggac tttgacaggc tgctggccag cctgcaggat 180 

cttagtgagc tggtggtaga gggtgacacc caagtgacac cagtgcccgg cggggcacgg 240 

ctgcgtaccc tcgagcccat cccgctgaag ctctaccgga atggcatcat gatgttcgac 300 

gggcccttcc agcccttcta cgatccctcc acacagcgct gcctccgaga catattggat 360 

ggcttctttc cctcagagct ccagcgactg taccccaatg gggtcccctt taaggtgagt 420 

gacttgcgca atcaggtcta cctggaggat ggactggacc ccttcccagg cgagggccgt 480 

gtggtgggca ggcagcggat gcacaaggcc ttggacaggg tggaggagca cccaggctcc 540 

aggatgactg ctgagaaatt tctgaacagg ctccccaagt tttgatccgg caaggcgagg 600 

tgattgacat ccggggcccc atcagggaca ccttgcagaa ctgctgccca ttgcctgccc 660 
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ggatccagga 


gactgtggtg gagacgccca 


ccttggccgc 


tgagcgagag 


aggagccagg 


720 


agtcacccaa 


cacaccggca cccccgctct 


ccatgctgcg 


catcaagtct 


gagaatgggg 


780 


aacaggcctt 


cctactgatg atgcagcctg 


acaacaccat 


tggggacgtg 


cgagctctgc 


840 


tagcgcaggc 


cagggtcatg gatgcctctg 


cctttgagat 


cttcagcaca 


ttcccgccca 


900 


ccctctacca 


ggacgataca ctcacgctgc 


aggctgcagg 


ccttgtgccc 


aaagcagcac 


960 


tgctgctgcg 


ggcacgccga gccccgaagt 


ccagcctgaa 


attcagtcct 


ggtccctgtc 


1020 


ccggtccc 










1028 



<210> 8 

<211> 343 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Leu Ser Asp Tyr Gly Leu Gin Trp Val Gly Glu Pro Met Asp Gin Glu 
15 10 15 



Asp Ser Glu Ser Lys Thr Val Ser Glu His Gly Glu Arg Asp Trp Met 
20 25 30 



Thr Ala Lys Lys Phe Trp Lys Pro Gly Asp Ser Leu Ala Pro Pro Glu 
35 40 45 



Val Asp Phe Asp Arg Leu Leu Ala Ser Leu Gin Asp Leu Ser Glu Leu 
50 55 60 



Val Val Glu Gly Asp Thr Gin Val Thr Pro Val Pro Gly Gly Ala Arg 
65 70 75 80 



Leu Arg Thr Leu Glu Pro lie Pro Leu Lys Leu Tyr Arg Asn Gly He 
85 90 95 



Met Met Phe Asp Gly Pro Phe Gin Pro Phe Tyr Asp Pro Ser Thr Gin 
100 105 110 



Arg Cys Leu Arg Asp He Leu Asp Gly Phe Phe Pro Ser Glu Leu Gin 
115 120 125 



Arg Leu Tyr Pro Asn Gly Val Pro Phe Lys Val Ser Asp Leu Arg Asn 
130 135 140 



Gin Val Tyr Leu Glu Asp Gly Leu Asp Pro Phe Pro Gly Glu Gly Arg 
145 150 155 160 



Val Val Gly Arg Gin Arg Met His Lys Ala Leu Asp Arg Val Glu Glu 
165 170 175 



His Pro Gly Ser Arg Met Thr Ala Glu Lys Phe Leu Asn Arg Leu Pro 
180 185 190 
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Lys Phe Val He Arg Gin Gly Glu Val He Asp He Arg Gly Pro He 
195 200 205 



Arg Asp Thr Leu Gin Asn Cys Cys Pro Leu Pro Ala Arg He Gin Glu 
210 215 220 



He Val Val Glu Thr Pro Thr Leu Ala Ala Glu Arg Glu Arg Ser Gin 
225 230 235 240 



Glu Ser Pro Asn Thr Pro Ala Pro Pro Leu Ser Met Leu Arg He Lya 
245 250 255 



Ser Glu Asn Gly Glu Gin Ala Phe Leu Leu Met Met Gin Pro Asp Asn 
260 265 270 



Thr He Gly Asp Val Arg Ala Leu Leu Ala Gin Ala Arg Val Met Asp 
275 280 285 



Ala Ser Ala Phe Glu He Phe Ser Thr Phe Pro Pro Thr Leu Tyr Gin 
290 295 300 



Asp Asp Thr Leu Thr Leu Gin Ala Ala Gly Leu Val Pro Lys Ala Ala 
305 310 315 320 



Leu Leu Leu Arg Ala Arg Arg Ala Pro Lys Ser Ser Leu Lys Phe Ser 
325 330 335 



Pro Gly Pro Cys Pro Gly Pro 
340 



<210> 9 

<211> 6 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Phe Ser Thr Phe Pro Pro 



<210> 10 

<211> 6 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Leu Val Pro Lys Ala Ala 
1 5 



<210> 11 

<211> 294 

<212> DNA 

<213> Homo sapiens 
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ggggacgtgc 


gagctctgct 


agcgcaggcc 


agggtcatgg 


atgcctctgc 


ctttgagatc 


60 


ttcagcacat 


tcccgcccac 


cctctaccag 


gacgatacac 


tcacgctgca 


ggctgcaggc 


120 


cttgtgccca 


aagcagcact 


gctgctgcgg 


gcacgccgag 


ccccgaagtc 


cagcctgaaa 


180 


ttcagtcctg 


gtccctgtcc 


cggtcccggt 


cccggcccca gtcccggtcc 


cggtcccggc 


240 


tccagtccct 


gtcccggacc 


cagtcccagc 


ccccaataaa 


gcacccaccc 


cctc 


294 



<210> 12 

<211> 92 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Gly Asp Val Arg Ala Leu Leu Ala Gin Ala Arg Val Met Asp Ala Ser 
IS 10 15 



Ala Phe Glu He Phe Ser Thr Phe Pro Pro Thr Leu Tyr Gin Asp Asp 
20 25 30 



Thr Leu Thr Leu Gin Ala Ala Gly Leu Val Pro Lys Ala Ala Leu Leu 
35 40 45 



Leu Arg Ala Arg Arg Ala Pro Lys Ser Ser Leu Lys Phe Ser Pro Gly 
50 55 60 



Pro Cys Pro Gly Pro Gly Pro Gly Pro Ser Pro Gly Pro Gly Pro Gly 
65 70 75 80 



Ser Ser Pro Cys Pro Gly Pro Ser Pro Ser Pro Gin 
85 90 



<210> 13 

<211> 19 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> PGR primer sequence 

<400> 13 

tccagcatgg tgtgtctga 19 



<210> 14 

<211> 18 

<212> DNA 

<213> artificial 

<220> 

<223> PCR primer sequence 

<400> 14 

ccttgaatgt ggtcatct 18 
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<210> 15 

<211> 23 

<212> DNA 

<213> artificial 

<220> 

<223> PCR primer sequence 

<400> IS 

cgtttcttgg agtactctac gtc 23 



<210> 16 

<211> 20 

<212> DNA 

<213> artificial 



<220> 

<223> PCR primer sequence 
<400> 16 

ccaccgcggc ccgctcgtct 20 



<210> 17 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Phe Ser Thr Phe Pro Pro Thr Leu Tyr Gin Asp Asp Thr Leu Thr Leu 
15 10 15 



Gin Ala Ala Gly 

20 



<210> 18 

<:211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Thr Leu Tyr Gin Asp Asp Thr Leu Thr Leu Gin Ala Ala Gly Leu Val 
15 10 15 



Pro Lys Ala Ala 
20 



<210> 19 

<211> 1771 

<212> DNA 

<213> Homo sapiens 

<400> 19 

aaaaaaccgc gtgacaacaa gatggcggcg ctgcgggacg gctagcggcc ctgcgtgtac 60 

tttcccaagc accaccaggc caaaggtctc tcagttcaga gcagaaagcc gtatacccag 120 

aggagcaggc agataacaga aacttccaga aacctctgtg gagacagtgg aagaggcaaa 180 

agggagttcc tgacagctgg attctagaag tagaactatg agctcacctt tggcctccct 240 
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tagcaagacc 


cgaaaagtgc 


ccctgccctc 


ggagcctatg aatcctggga 


ggcgaggaat 


300 


ccgcatctat 


ggagatgaag 


atgaggtgga 


catgttgagt gatgggtgtg gctcggaaga 


360 


aaagatctca 


gtcccttcct 


gctatggcgg 


cataggtgcc cctgtgagtc ggcaagtccc 


420 


tgcatcccat 


gactcggagc 


tgatggcctt 


catgacgagg aagttgtggg 


acctggagca 


480 


gcaggtgaag 


gcccagactg 


atgagatact 


gtccaaggat cagaagatag 


cggccctaga 


540 


ggacctggtg 


cagaccctcc 


ggccacaccc 


agccgaggca accctgcagc 


ggcaggagga 


600 


actggagacg 


atgtgtgtgc 


agctgcagcg 


gcaggtcagg gagatggagc 


ggttcctcag 


660 


tgactatggc 


ctgcagtggg 


tgggcgagcc 


catggaccag gaggactcag 


agagcaagac 


720 


agtctcagag 


catggcgaga 


gggactggat 


gacagccaag aagttctgga 


agccagggga 


780 


ctcattggcg 


ccccctgagg 


tggactttga 


caggctgctg gccagcctgc 


aggatcttag 


840 


tgagctggtg 


gtagagggtg 


acacccaagt 


gacaccagtg cccggcgggg 


cacggctgcg 


900 


taccctcgag 


cccatcccgc 


tgaagctcta 


ccggaatggc atcatgatgt 


tcgacgggcc 


960 


cttccagccc 


ttctacgatc 


cctccacaca 


gcgctgcctc cgagacatat tggatggctt 


1020 


ctttccctca 


gagctccagc 


gactgtaccc 


caatggggtc ccctttaagg tgagtgactt 


1080 


gcgcaatcag 


gtctacctgg 


aggatggact 


ggaccccttc ccaggcgagg gccgtgtggt 


1140 


gggcaggcag 


cggatgcaca 


aggccttgga 


cagggtggag gagcacccag gctccaggat 


1200 


gactgctgag 


aaatttctga 


acaggctccc 


caagtttgtg atccggcaag gcgaggtgat 


1260 


tgacatccgg 


ggccccatca 


gggacacctt 


gcagaactgc tgcccattgc 


ctgcccggat 


1320 


ccaggagatt 


gtggtggaga 


cgcccacctt 


ggccgctgag cgagagagga gccaggagtc 


1380 


acccaacaca 


ccggcacccc 


cgctctccat 


gctgcgcatc aagtctgaga atggggaaca 


1440 


ggccttccta 


ctgatgatgc 


agcctgacaa 


caccattggg gacgtgcgag 


ctctgctagc 


1500 


gcaggccagg 


gtcatggatg 


cctctgcctt 


tgagatcttc agcacattcc 


cgcccaccct 


1560 


ctaccaggac 


gatacactca 




tgcaggcctt gtgcccaaag cagcactgct 


1620 


gctgcgggca 


cgccgagccc 


cgaagtccag 


cctgaaattc agtcctggtc cctgtcccgg 


1680 


tcccggtccc 


ggccccagtc 


ccggtcccgg 


tcccggctcc agtccctgtc 


ccggacccag 


1740 


tcccagcccc 


caataaagca 


cccgccccct 


c 




1771 



<210> 20 

<211> 512 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Met Ser Ser Pro Leu Ala Ser Leu Ser Lys Thr Arg Lys Val Pro Leu 
15 10 15 

Pro Ser Glu Pro Met Asn Pro Gly Arg Arg Gly lie Arg lie Tyr Gly 
20 25 30 
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Asp Glu Asp Glu Val Asp Met Leu Ser Asp Gly Cys Gly Ser Glu Glu 
35 40 45 



Lys lie Ser Val Pro Ser Cys Tyr Gly Gly lie Gly Ala Pro Val Ser 
50 55 60 



Arg Gin Val Pro Ala Ser His Asp Ser Glu Leu Met Ala Phe Met Thr 
SS 70 75 80 



Arg Lys Leu Trp Asp Leu Glu Gin Gin Val Lys Ala Gin Thr Asp Glu 
85 90 95 



lie Leu Ser Lys Asp Gin Lys He Ala Ala Leu Glu Asp Leu Val Gin 
100 105 110 



Thr Leu Arg Pro His Pro Ala Glu Ala Thr Leu Gin Arg Gin Glu Glu 
115 120 125 



Leu Glu Thr Met Cys Val Gin Leu Gin Arg Gin Val Arg Glu Met Glu 
130 135 140 



Arg Phe Leu Ser Asp Tyr Gly Leu Gin Trp Val Gly Glu Pro Met Asp 
145 150 155 160 



Gin Glu Asp Ser Glu Ser Lys Thr Val Ser Glu His Gly Glu Arg Asp 
165 170 175 



Trp Met Thr Ala Lys Lys Phe Trp Lys Pro Gly Asp Ser Leu Ala Pro 
180 185 190 



Pro Glu Val Asp Phe Asp Arg Leu Leu Ala Ser Leu Gin Asp Leu Ser 
195 200 205 



Glu Leu Val Val Glu Gly Asp Thr Gin Val Thr Pro Val Pro Gly Gly 
210 215 220 



Ala Arg Leu Arg Thr Leu Glu Pro He Pro Leu Lys Leu Tyr Arg Asn 
225 230 235 240 



Gly He Met Met Phe Asp Gly Pro Phe Gin Pro Phe Tyr Asp Pro Ser 
245 250 255 



Thr Gin Arg Cys Leu Arg Asp He Leu Asp Gly Phe Phe Pro Ser Glu 
260 265 270 



Leu Gin Arg Leu Tyr Pro Asn Gly Val Pro Phe Lys Val Ser Asp Leu 
275 280 285 



Arg Asn Gin Val Tyr Leu Glu Asp Gly Leu Asp Pro Phe Pro Gly Glu 
290 295 300 
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Gly Arg Val Val Gly Arg Gin Arg Met His Lys Ala Leu Asp Arg Val 
305 310 315 320 



Glu Glu His Pro Gly Ser Arg Met Thr Ala Glu Lys Phe Leu Asn Arg 
325 330 335 



Leu Pro Lys Phe Val He Arg Gin Gly Glu Val lie Asp He Arg Gly 
340 345 350 



Pro He Arg Asp Thr Leu Gin Asn Cys Cys Pro Leu Pro Ala Arg lie 
355 360 365 



Gin Glu He Val Val Glu Thr Pro Thr Leu Ala Ala Glu Arg Glu Arg 
370 375 380 



Ser Gin Glu Ser Pro Asn Thr Pro Ala Pro Pro Leu Ser Met Leu Arg 
385 390 395 400 



He Lys Ser Glu Asn Gly Glu Gin Ala Phe Leu Leu Met Met Gin Pro 
405 410 415 



Asp Asn Thr He Gly Asp Val Arg Ala Leu Leu Ala Gin Ala Arg Val 
420 425 430 



Met Asp Ala Ser Ala Phe Glu He Phe Ser Thr Phe Pro Pro Thr Leu 

435 440 445 



Tyr Gin Asp Asp Thr Leu Thr Leu Gin Ala Ala Gly Leu Val Pro Lys 
450 455 460 



Ala Ala Leu Leu Leu Arg Ala Arg Arg Ala Pro Lys Ser Ser Leu Lys 
465 470 475 480 



Phe Ser Pro Gly Pro Cys Pro Gly Pro Gly Pro Gly Pro Ser Pro Gly 
485 490 495 



Pro Gly Pro Gly Ser Ser Pro Cys Pro Gly Pro Ser Pro Ser Pro Gin 
500 505 510 
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